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We study the game of Dots-and-Boxes from a statistical point of view. The early game can be 
treated as a case of Random Sequential Adsorption, with a jamming transition that marks the 
beginning of the end-game. We derive set of differential equations to make predictions about the 
state of the lattice at the transition, and thus about the distribution of avalanches in the end-game. 



Real-life games have traditionally inspired research 
by mathematicians, economists, psychologists, computer 
scientists and also physicists, starting with the develop- 
ment of probability theory to handle games of chance 
Jl], ||. Later fields of research were economic game 
theory [Q IE), where usually two or more players have 
to make simultaneous and independent decisions with- 
out knowing what the other player is going to do, and 
combinatorial game theory || R], where players take al- 
ternating turns, and all information on the state of the 
game and the possible future moves is available. 

The popular children's game of Dots-and-Boxes falls 
into the latter category. The game is played on a rect- 
angular lattice (a checkered sheet of paper), and players 
take alternating turns. At each turn, the active player 
occupies an edge. If he thus occupies the fourth edge of 
at least one of the two adjacent squares, the player con- 
tinues the turn by placing another edge. Rules vary on 
whether a player must take a square if he has the chance 
- the option not to take it allows for a number of subtle 
moves (for a guide to the end-game of Dots-and-Boxes, 
see Ref. Q, Vol. 2); however, for simplicity we demand 
that any chance to take a square must be used. The 
game ends when all edges and squares are occupied. The 
player who took more squares wins. 

The game can be separated into two distict phases: in 
the early game, players usually occupy edges more or less 
at random. However, they avoid placing the third edge 
around any square, which would give the opponent the 
opportunity to score. Phase 1 ends when this is no longer 
possible: all free edges have at least one adjacent square 
with two occupied edges. This situation is analogous 
to a jamming transition in models of random sequential 
adsorption (RSA). The main focus of this paper is the 
modeling of the early game, using methods from RSA 
theory and Monte Carlo simulations, and to determine 
the time and state of the game at the jamming transition. 

In the end game, squares are rapidly filled: each edge 
creates an opportunity to score, which often triggers an- 
other opportunity, and another, until the avalanche is ter- 
minated somehow. The end-game is largely determined 
by nonlocal strategies (like figuring out what the short- 
est possible avalanches are), and thus not accessible to 
the methods used for the early game. However, the state 



of the game at the beginning of the end-game limits the 
options available for the rest of the game, and allows for 
predictions of the minimum number of turns until the 
end of the game, the average size of avalanches, and so 
forth. 



I. THE MODEL 

As a model for the game that can be both simulated 
with reasonable computational effort and handled ana- 
lytically, we use the following rules of behaviour for the 
players: 

1. Occupy squares if possible: in accordance with the 
rules that prescribe a greedy strategy, the active 
player searches the board for all free edges with an 
adjacent square with exactly three occupied edges 
around it, picks one of them at random, occupies 
it, and continues his turn. This is repeated as often 
as possible. 

2. Create no opportunities for the opponent: if there 
are no squares to be taken, the player looks for 
free edges whose adjacent squares both have less 
than two occupied edges. One of those is picked at 
random and occupied, and the turn ends. 

3. Minimize the opponent's score: if both prescrip- 
tions (1) and (2) yield no suitable edge, the ac- 
tive player has to pick the third edge around some 
square, thus giving his opponent a chance to take 
it according to (1). He checks all edges to see how 
many points his opponent would gain from them. 
He picks one of those that give away the smallest 
number of points, occupies it, and ends the turn. 

To get rid of boundary effects, the lattice on which 
the game is played is assumed to have periodic bound- 
ary conditions. Furthermore, the analytical treatment is 
strictly valid in the limit of infinite lattice size only. 

The lattice has N squares. Accordingly, the number 
of edges is 2N. The number of turns is counted by T. 
The rescaled time t = T/(2N) runs from to at most 1; 
however, since more than one edge per turn is occupied in 
the end-game, the game ends at times t < 1. Two other 
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FIG. 1: Free edges are classified according to the number of 
occupied edges in adjacent squares. In all cases shown here, 
the central edge (dashed line) has the indices 21. All possible 
configuration with the same index ij are assumed to have 
equal probabilities. 



useful quantities to describe the system are the number P 
of occupied edges (or p = P/ (2N), the probability that a 
given edge is occupied), and the number of filled squares 
S (or s — S/N, respectively). 



II. THE EARLY GAME 

As mentioned in the introduction, the early game can 
be treated as a special case of random sequential ad- 
sortion (RSA) |^|]. The basic idea of RSA is to deposit 
particles (atoms, dimers, or, in our case, edges) on ran- 
domly chosen sites of a surface (often on a regular lattice) 
unless this deposition violates restrictions posed by par- 
ticles that were adsorbed before. In Dots-and-Boxes, the 
edges form a square lattice with a peculiar short-range 
three-particle repulsion. 

The usual procedure to treat RSA problems analyt- 
ically is to solve a set of coupled differential equations 
that describe the probability of encountering the various 
possible configurations of particles (for details, see ||). 
Unfortunately, this set of ODEs is generally not closed, 
i.e., the equations for small configurations include prob- 
abilities of larger configuration, which in turn depend on 
still larger configurations. At some point, one has to ne- 
glect correlations and truncate the equations. Since the 
number of terms increases dramatically with increasing 
order of truncation, we will use the simplest approxima- 
tion that still captures the interaction correctly. 

This approximation characterizes each free edge by two 
indices - the first index i for the number of occupied 
edges surrounding the adjacent square above or to the 
left of the considered edge, the second index j for the 
occupation number of the square below or to the right 
(for example, Fig.|l] shows all configurations with index 
21 around a horizontal edge). One can then count the 
number of free edges with indices ij, and determine 
their density /y = F lj /(2N). 

All possible configurations with the same indices are 
now assumed to have the same probability, and correla- 
tions beyond nearest neighbours are neglected. For ex- 
ample, if we consider a free edge with indices 11, we 
have no information on the free edges surrounding its 
adjacent squares, except that one of their indices must 
be 1. Accordingly, we assume that the other index 
(let us say, j) follows a simple conditional probability, 
Prob(j|i) = /y/Ek/i*- 



In the inititial phase of the game, there is an extensive 
number of edges that can be occupied without giving the 
opponent an opportunity to score, following prescription 
(2) of the strategy. These are edges from the categories 
00, 10, 01, and 11. Now we can see what happens to the 
fij if an edge is occupied, i.e., a time step dt = 1/(2N) 
elapses. The chosen edge (let us say, it has indices kt) is 
occupied, and fu is decreased by dfki — 1/(2N). This 
means that for all edges that can be chosen, the differ- 
ential equation for fki includes a loss term equal to the 
probability that an edge with index kl is picked: 



fkl 
ff 



(1) 



where // is the density of edges that can be occupied 
before the transition, 



// — /oo + /oi + fio + fi 



(2) 



The free edges in the squares next to the chosen edge 
must be updated: the index corresponding to the con- 
sidered square is increased by one. For instance, assume 
that the upper/left adjacent square of the chosen edge 
has occupation number (which happens with probabil- 
ity (/oo + foi)/ ff)- Two of the three remaining edges 
around that square will then have their second index 
increased by one, whereas one will have its first index 
incremented. This leads to gain and loss terms in the 
ODEs: 



fiO 
foi 
fil 
fit 



2 fio foo + /oi 

J2k=0 ho ff 
foi /oo + ,/bi 

J2k=ofok ff 

2 fio foo + foi 
J2k=o ho f f 
foi foo + ,/bi 

J2k=ohk ff 



(3) 



What follows is a rather tedious summation of terms for 
the possible combinations of indices. 

The equations derived in this fashion can be simpli- 
fied considerably by assuming the symmetry fij = fji, 
thus keeping only the categories with i > j. With the 
abbreviations 

ro = (/oo + /io)/(/oo + fio + ho); (4) 
n = (/io + /ii)/(/io + /n + /2i), (5) 

the resulting system of differential equations looks as fol- 
lows: 



dfoo 
dt 

dfio 
dt 

dfn 
dt 



{-foo - 6/oofo)///; 

(-/io + 3/oofo ~ 3/i r - 2/ion)///; 

(-hi + 6/ioro - 4/nn)///; 



3 




FIG. 2: Comparison between the numerical solution of the 
ODEs and a simulation with N — 400 x 400. 



= (-3/ 20 ro+2/ 10 ri)// / ; 
^ = (-2/2ir 1 +2/ u n+3/ 20 ro)///; 
^§ = ihin/ff. (6) 

This can be solved numerically and compared to simula- 
tions. The agreement is very good, but becomes slightly 
worse close to the jamming transition - the point where 
// becomes zero and the early game ends. The accuracy 
could probably be improved by including probabilies of 
larger configurations; however, it is not worth the effort, 
since the calculated numbers agree with results from real 
play only in the order of magnitude anyway (see Section 
|V|). The predictions and numerical values for the jam- 
ming time tj and the order parameters at that time are 
given in Table [j| 





theory 


simulation 


tj 


0.4615 


0.4657 


fll 


0.3244 


0.3409 


/» 


0.0901 


0.0846 


/20 


0.0169 


0.0121 



TABLE I: Order parameters at the jamming transition for 
the square lattice. 



III. THE END-GAME 

When the last edge from the categories 00, 01, 10, and 
11 has been taken, prescription (2) is no longer an op- 
tion. The game now alternates between prescriptions (1) 



and (3): Each player's turn begins by filling the squares 
of the avalanche that his opponent has offered him by 
placing the third edge around some square (prescription 
(1)). When all possible squares have been taken, the 
active player now determines the avalanche that his op- 
ponent must take (prescription (3)). Since the length 
of the avalanche triggered by placing an edge is not a 
local property of that edge (and highly correlated to 
that of neighboring edges), a description by differential 
equations analogous to the early game makes little sense. 
However, since the state of the system at the transition 
largely determines the options of the players later on, we 
can make quantitative predictions about the end-game 
from the knowledge gained in Sec. Q. 

Fig. H shows the state of a game with 15x15 squares 
at the jamming transition. The squares are segments of a 
tunnel if they have two occupied egdes. They represent 
tunnel branchings (or single defects, marked by empty 
circles) if they have only one edge, and tunnel crossings 
(or double defects, marked by full circles) if they have 
none. 

Thus, three edges from the /21-category form a single 
defect, whereas four /20-edges make up a double defect. 
The density of defects can be calculated directly from the 
order parameters at the transition. 

As an additional check, one can make sure that the 
total number of edges (both occupied and free) add up to 
2N, and that the number of squares (tunnels and defects) 
add up to N. This leads to the following equations: 

t.j + 2/ 20 + 2/ 21 + / 22 = 1; (7) 
3/20 + (10/3)/ 2 i + 2/22 = 1, (8) 

which arc fulfilled for both the analytical and the exper- 
imental values. 

This enables us to make some statements about the 
avalanches or chains of occupied squares that occur in 
the late game. An avalanche started in a tunnel fills 
the tunnel and ends at the defect edges on both sides 
of the tunnel. Thus, with 4,N{f 2 \ + /20) defect edges, 
there are at least Na = 2N{f 2 \ + /20) different poten- 
tial avalanches at the time of the transition. (For now, 
we neglect avalanches in closed areas and other compli- 
cations.) We can also calculate the number of tunnel 
segments (squares with occupation 2) from the order pa- 
rameters: N T = N - (4/3)iV/ 2 i - Nf 20 . This yields an 
average length of the tunnel segments of Nt/Na ~ 4.5 
for the values of from the simulation, as given above. 

Note that in an analogy to the "waiting time para- 
doxon" , the average avalanche length becomes larger 
than the mentioned value of 4.5 if avalanches are started 
at randomly chosen edges rather than randomly chosen 
tunnel segments, because longer tunnels include more 
edges and are thus chosen with higher probability: 

Let us assume that the probability distribution P av (l) 
of tunnel lengths I follows an exponential with a decay 
constant l/l*. Since I > 1, the normalization constant is 
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FIG. 3: A typical state of the system at the jamming tran- 
sition. Single defects are marked with empty circles, double 
defects with full circles, and shading denotes a closed area. 
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FIG. 4: Probability distribution P e d(0 °f avalanche lengths 
I at the jamming transition, averaged over randomly chosen 
free edges. Simulations average over 40 runs with N = 50 x 50. 



of the game t E should be at least 



exp(l/Z*) — 1, and the average value is 

oo ^ 

{ l )av = J2Ke 1/l * -De' 1 ' 1 * = T — IJF . (9) 
;=i 

With the mentioned value of {l) av — 4.5, one gets I* as 
4.0 and P av (l) « 0.284 exp(-Z/4.00). Since each tunnel 
avalanche of Z squares length has I + 1 edges where it 
can be started, the probability distribution of avalanche 
lengths averaged over free edges follows the form P e d{l) oc 
(Z + 1) exp(-Z/Z*). 

This agrees fairly well with simulations, as seen in Fig. 
|]. However, there is a preference for even avalanche 
lengths, which can partly be explained with the presence 
of closed areas. These are areas that contain no defects 
and are separated from the rest of the board by occupied 
edges. They include an even number > 4 of squares. The 
probability of an edge being in a closed area of size Z is 
shown in Fig. |I] (open circles). Even if it is taken into 
account, even avalanches are more likely, for reasons that 
are still unclear. 

The density of defects at the transition allows for a 
prediction of the total number of turns in the end-game: 
When an avalanche is terminated at a defect, that de- 
fect is turned into a tunnel segment (if it was a single 
defect before) or into a single defect (if it was a dou- 
ble defect). Therefore the number of defect-terminated 
avalanches Ndta is half the number of single defects plus 
the number of double defects: 

Ndta = ((2/3)/ 21 + fo)/N « 0.0697V. (10) 

This means that the time difference from tj to the end 



t E - tj > N DTA /(2N) = 0.034. (11) 

However, this is really only a lower bound on the time 
found in simulations, t E — tj ~ 0.054. The reason 
for the deviation is the existence of avalanches that do 
not change the number of defects, namely, avalanches 
in closed areas. These areas are not necessarily present 
at the beginning of the end-game. Instead, they may ini- 
tially be half-closed areas: areas that are not quite closed, 
but connected to the rest of the system by a single de- 
fect. Depending on whether the avalanche is started in 
the tunnel outside the half-closed area or inside it, it is 
either turned into a closed area, or it is filled, the defect is 
removed, and the adjacent tunnel is filled as well. Since 
it is usually desirable to give the opponent as few points 
as possible, most of the half-closed areas in real play will 
be turned into closed areas, and then filled. 

Apart from these exceptions, avalanches tend to get 
dramatically longer as the end-game goes on (see Fig. 
||). This is due to two effects: first, small avalanches 
are triggered earlier than larger ones due to prescription 
(3), and thus removed; second, avalanches that stop at a 
single defect turn it into a tunnel segment, merging two 
potential avalanches into one. 



IV. OTHER LATTICES 

The game can be played on lattices other than the 
square, as long as there is a notion of an edge separating 
two cells, and placing a single edge is not enough to make 
scoring possible. The simplest case is the triangular lat- 
tice, where there are only single defects, and the relevant 
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FIG. 5: Time development of fraction of occupied edges (p) 
and squares (s) in a simulation on an 80 x 80 lattics. 



FIG. 6: Comparison between the analytical solution (Eqs. 
(E^)-(E3)) of the ODEs and a simulation on a triangular lat- 
tice with N = 20000. 



order parameters in the early game are /oq, /iq, and fn. 
The corresponding differential equations are 



dfoo 
dt 

dfw 
dt 

dhi 
dt 



-1-4 



JO' 



Joo 
foo — J 10 , 
Joo + /io ' 

fio 
foo + ho 



(12) 
(13) 
(14) 



They can even be solved analytically by introducing a 
rescaled time r with dr = (foo + fio)~ 1 dt, and solving 
the remaining system of linear ODEs in r with constant 
coefficients. One gets 



/oo(r) 


= e~ iT (2-e T )- 


(15) 


/io(r) 


= 2e- 4r (e r -l); 


(16) 


/h(t) 


= 2e" 4r -(8/3)e- 3r + 2/3. 


(17) 



Using Eqs. (|l|) and ©, i can be calculated: 
t{r) = (1 - e- 3T )/3; r(t) = -(1/3) ln(l - it). 



(18) 



Again, agreement between theory and simulation is very 
good, as seen in Fig. ^. The predicted and observed val- 
ues for the jamming transition and the order parameters 
are given m Table || 





theory 


simulation 


tj 


7/24= 0.291667 


0.2930 ± 0.0002 


ho 


1/8 =0.125 


0.1209 ±0.0003 


hi 


11/24 = 0.45833 


0.4655 ± 0.0002 



TABLE II: Order parameters at the transition for the trian- 
gular lattice 



Other possible lattices include hexagonal and three- 
dimensional cubic lattices. In the latter case, edges cor- 
respond to faces of unit cubes. Interestingly, the differ- 
ential equations are the same for both the hexagonal and 
the 3D-cubic lattice, since both have six edges/faces sur- 
rounding each hexagon/cube. Although structurally sim- 
ple, the equations involve fifteen order parameters and 
are not written out for the sake of brevity. 

Of course, the whole range from single to quadruple de- 
fects can occur; however, multiple defects are rarer than 
single ones, as seen in Table flE 





theory 


simulation 3D 


simulation hexagonal 


tj 


0.6367 


0.6381 ±0.0001 


0.6351 ±0.0001 


ho 


0.00017 


0.00010 ± 0.00003 


0.00030 ± 0.00003 


hi 


0.0027 


0.0022 ± 0.00001 


0.0036 ± 0.0001 


fi2 


0.0177 


0.0162 ±0.0002 


0.0198 ±0.0002 


/« 


0.0577 


0.05723 ± 0.0002 


0.0579 ± 0.0002 


fa 


0.2064 


0.2105 ±0.0002 


0.2018 ±0.0002 



TABLE III: Order parameters for the 3D cubic and hexagonal 
lattice. 



Results from simulations confirm the picture predicted 
by calculations, with the usual deviations on the order of 
IO" 3 . 

Is the game still interesting on other lattices? Disre- 
garding the practical difficulties of playing on a 3D-cubic 
lattice, all basic mechanisms of the game still work, in- 
cluding closed and half-closed areas. A rough estimate 
shows that the initial avalanche length (averaged over 
possible avalanches) is 4.8 for the triangular lattice and 
3.6 for the 3D-cubic and hexagonal lattice, similar to the 
square lattice. We therefore expect that real-life games 
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FIG. 7: Comparison between a simulation with AT = 80 x 80 
and a real-life game on a 10 x 10-lattice. To avoid confusion, 
only the curves for foo, fio, fu, and the density of filled 
squares p is shown in this figure - see also Fig. ^. 

on other lattices would not be much different from Dots- 
and-Boxes on regular square lattices. 

V. COMPARISON TO REAL PLAY 

We let some coworkers play a computer version of 
Dots-and-Boxes (with periodic boundary conditions and 
N = lOx 10) to see if their style of play is well described 
by the assumptions in Section I. Generally speaking, hu- 
man players did not place edges at random in the early 
game; instead they tended to add edges to existing struc- 
tures. In some cases, this led to a significantly lower 
number of defects, and thus longer avalanches. One pair 
of players chose to get rid of the periodic boundary con- 
ditions by drawing a frame around the board early in the 
game. 

Nevertheless, some games showed quantitative similar- 
ities to our theoretical predictions. The order parameters 
from one of these games is shown in Figs [?] and [| 

Our test players did not try tactical subtleties like ad- 
justing the number of avalanches in order to get the last 
(and presumably longest) one. They were usually happy 
if they avoided blatant mistakes. 

Of course, one could include human tendencies and 
extend the model to include cooperative sequential ad- 
sorption H|, where edges are preferrably placed next to 
edges occupied before. However, since this could not de- 
scribe all human players with the same set of parameters 
and would probably give no qualitative new insights, the 
usefulness of this extension is questionable. 



FIG. 8: The remaining order parameters of the game shown 
in Fig. |. 

VI. SUMMARY AND CONCLUSION 

We gave a statistical treatment of the game of Dots- 
and-Boxes, using some simplifying assumptions for the 
behaviour of the players. In the early game, since a finite 
fraction of edges can be chosen, a mean-field descrip- 
tion given by a system of coupled differential equations 
works well. It makes predictions about the point where 
avalanches start and the degree of geometrical frustra- 
tion at that point. The same scheme works for all kinds 
of regular lattices; the relevant quantity is the number 
of edges or faces around a cell, such that hexagonal and 
3D-cubic lattices are described the same equations. 

These predictions allow for statements regarding the 
statistics of avalanches, as well as the total number of 
turns in the end-game. The presence of closed and half- 
closed areas makes the situation more complicated; un- 
fortunately, they cannot be captured by the approxima- 
tions made in the calculation of the early game. 

While results from calculations give good agreement 
with simulations, human players have various habits that 
cannot be easily included in an all-encompassing mean- 
field treatment ("I like making corners. They look nice.") 
Thus, while our analysis has yielded some insight in 
the underlying processes of Dots-and-Boxes, quantitative 
agreement with human play is not always satisfactory. 
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